Multilabel Classification in Parallel Tasks
نویسنده
چکیده
In real world multilabel problems, it is often the case that e.g. documents are simultaneously classified with labels from multiple domains, such as genres in addition to topics. In practice, each of these problems is solved independently without taking advantage of possible label correlations between domains. Following the multi-task learning setting, in which multiple similar tasks are learned in parallel, we propose a global learning approach that jointly considers all domains. It is empirically demonstrated in this work that this approach is effective despite its simplicity when using a multilabel learner that takes label correlations into account.
منابع مشابه
Efficient Text Categorization Using a Min-Max Modular Support Vector Machine
The min-max modular support vector machine (M-SVM) has been proposed for solving large-scale and complex multiclass classification problems. In this paper, we apply the M-SVM to multilabel text categorization and introduce two task decomposition strategies into M-SVMs. A multilabel classification task can be split up into a set of two-class classification tasks. These two-class tasks are to dis...
متن کاملGraphical Feature Selection for Multilabel Classification Tasks
Multilabel was introduced as an extension of multi-class classification to cope with complex learning tasks in different application fields as text categorization, video o music tagging or bio-medical labeling of gene functions or diseases. The aim is to predict a set of classes (called labels in this context) instead of a single one. In this paper we deal with the problem of feature selection ...
متن کاملAdaptive Large Margin Training for Multilabel Classification
Multilabel classification is a central problem in many areas of data analysis, including text and multimedia categorization, where individual data objects need to be assigned multiple labels. A key challenge in these tasks is to learn a classifier that can properly exploit label correlations without requiring exponential enumeration of label subsets during training or testing. We investigate no...
متن کاملEfficient Pairwise Multilabel Classification for Large-Scale Problems in the Legal Domain
In this paper we applied multilabel classification algorithms to the EUR-Lex database of legal documents of the European Union. On this document collection, we studied three different multilabel classification problems, the largest being the categorization into the EUROVOC concept hierarchy with almost 4000 classes. We evaluated three algorithms: (i) the binary relevance approach which independ...
متن کاملAn Evaluation of Efficient Multilabel Classification Algorithms for Large-Scale Problems in the Legal Domain
In this paper we evaluate the performance of multilabel classification algorithms on two classification tasks related to documents of the EURLex database of legal documents of the European Union. It permits different settings of large-scale multilabel problems with up to 4000 classes with the same underlying documents. We compared the well known one-against-all approach (OAA) and its recently p...
متن کامل